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SEQUENCE LISTING 

<110> Tomikawa, Mayumi 
Aikawa, Seiichi 
Matsuzawa, Fumiko 
<12 0> Method and Apparatus for Extracting and Evaluating Mutually 
Similar Portions in One-Dimensional Sequences in Molecules and/or 
Three -Dimensional Structures of Molecules 



<130> 


522 .1921D3 


<140> 
<141> 


09/909, 809 
2001-07-23 


<150> 
<151> 


US 08/014,867 
1993-02-08 


<160> 


21 


<170> 




<210> 
<211> 
<212> 
<213> 


1 

37 

PRT 

human 


<400> 


1 


Gly Asp Val Glu Lys 
1 5 



10 15 



His Thr Val Glu Gly Gly Lys His Lys Thr Gly Pro Asn Leu His Gly 
20 25 30 



Leu Phe Gly Arg Lys 
35 



<210> 2 

<211> 39 

<212> PRT 

<213> bacterium 

<400> 2 
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Glu Gly Asp Asp Ala Ala Ala Gly Glu Lys Val Ser Lys Lys Cys Leu 
15 10 15 



Ala Cys His Thr Phe Asp Gin Gly Gly Ala Asn Lys Val Gly Pro Asn 
20 25 30 



Pro Asn Leu Phe Gly Val Phe 
35 



<210> 3 

<211> 51 

<212> PRT 

<213> rat 

<400> 3 



Met Ser Leu Ala lie Leu Arg Val lie Arg Leu Val Arg Val Phe Arg 
15 10 15 



lie Phe Lys Leu Ser Arg His Ser Lys Gly Leu Gin lie Leu Gly Arg 
20 25 30 



Thr Leu Lys Ala Ser Met Arg Glu Leu Gly Leu Leu lie Phe Phe lie 
35 40 45 



Gly Val Val 
50 



<210> 4 
<211> 142 
<212> PRT 
<213> unknown 
<220> 

<223> Figure 23A 
<400> 4 

Thr Glu Glu Gin lie Ala Glu Phe Lys Glu Ala Phe Ser Leu Phe Asp 
15 10 15 
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Lys Asp Gly Asp Gly Thr lie Thr Thr Lys Glu Leu Gly Thr Val Met 
20 25 30 



Arg Ser Leu Gly Gin Asn Pro Thr Glu Ala Glu Leu Gin Asp Met lie 
35 40 45 



Asn Glu Val Asp Ala Asp Gly Asn Gly Thr lie Asp Phe Pro Glu Phe 
50 55 60 



Leu Thr Met Met Ala Arg Lys Met Lys Asp Thr Ser Glu Glu Glu lie 
65 70 75 80 



Arg Glu Ala Phe Arg Val Phe Asp Lys Asp Gly Asn Gly Tyr lie Ser 

85 90 95 



Ala Ala Glu Leu Arg His Val Met Thr Asn Leu Gly Glu Lys Leu Thr 
100 105 110 



Asp Glu Glu Val Asp Glu Met lie Arg Glu Ala Asn lie Asp Gly Asp 
115 120 125 



Gly Gin Val Asn Tyr Glu Glu Phe Val Gin Met Met Thr Ala 
130 135 140 



<210> 5 
<211> 159 
<212> PRT 
< 2 1 3 > unknown 
<220> 

<223> Figure 23B 
<400> 5 

Ala Met Asp Gin Gin Ala Glu Ala Arg Ala Phe Leu Ser Glu Glu Met 
15 10 15 



lie Ala Glu Phe Lys Ala Ala Phe Asp Met Phe Asp Ala Asp Gly Gly 
20 25 30 
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Gly Asp lie Ser Thr Lys Glu Leu Gly Thr Val Met Arg Met Leu Gly 
35 40 45 



Gin Asn Pro Thr Lys Glu Glu Leu Asp Ala lie lie Glu Glu Val Asp 
50 55 60 



Glu Asp Gly Ser Gly Thr lie Asp Phe Glu Glu Phe Leu Val Met Val 
65 70 75 80 



Arg Gin Met Lys Glu Asp Ala Lys Gly Lys Ser Glu Glu Glu Leu Ala 

85 90 95 



Asp Cys Phe Arg lie Phe Asp Lys Asn Ala Asp Gly Phe lie Asp lie 
100 105 110 



Glu Glu Leu Gly Glu lie Leu Arg Ala Thr Gly Glu His Val Thr Glu 
115 120 125 



Glu Asp lie Glu Asp Leu Met Lys Asp Ser Asp Lys Asn Asn Asp Gly 
130 135 140 



Arg lie Asp Phe Asp Glu Phe Leu Lys Met Met Glu Gly Val Gin 
145 150 155 



<210> 6 
<211> 28 
<212>' PRT 
<213> unknown 
<220> 

<223> calmodulin probe site 81-108, Figure 25, target 
<400> 6 

Leu Ala Asp Cys Phe Arg lie Phe Asp Lys Asn Ala Asp Gly Phe lie 
15 10 15 



Asp lie Glu Glu Leu Gly Glu lie Leu Arg Ala Thr 
20 25 
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<210> 7 
<2I1> 28 
<212> PRT 
<213> unknown 
<220> 

<223> calmodulin probe site 81-108, probe, Figure 25 
<400> 7 

lie Arg Glu Ala Phe Arg Val Phe Asp Lys Asp Gly Asn Gly Tyr lie 
15 10 15 



Ser Ala Ala Glu Leu Arg His Val Met Thr Asn Leu 
20 25 



<210> 8 
<211> 27 
<212> PRT 
< 2 1 3 > unknown 
<220> 

<223> Calmodulin probe site 117-143, Figure 26, target 
<400> 8 

lie Glu Asp Leu Met Lys Asp Ser Asp Lys Asn Asn Asp Gly Arg lie 
15 10 15 



Asp Phe Asp Glu Phe Leu Lys Met Met Glu Gly 
20 25 



<210> 9 

<211> 27 

<212> PRT 

< 2 1 3 > unkown 

<400> 9 

Val Asp Glu Met lie Arg Glu Ala Asn lie Asp Gly Asp Gly Gin Val 
15 10 15 
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Asn Tyr Glu Glu Phe Val Gin Met Met Thr Ala 
20 25 



<210> 10 
<211> 8 
<212> PRT 
<213> unknown 
<220> 

<223> Coordinates of C-alpha corresponding to the aminon acid res 
idue N 

OS. 7 to 14 in elongation factor of protein which is a bind 

ing si 

te for phosphoric acid of GTP 
<400> 10 

Gly His Val Asp His Gly Lys Thr 
1 5 



<210> 11 

<211> 8 

<212> PRT 

<213> unknown 
<220> 

<223> adenylate kinase - target 

<400> 11 

Gly Ala Pro Gly Ser Gly Lys Gly 
1 5 



<210> 12 

<211> 8 

<212> PRT 

<213> unknown 
<220> 

<223> ras protein - target 

<400> 12 

Gly Ala Gly Gly Val Gly Lys Ser 
1 5 



<210> 13 
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<211> 8 

<212> PRT 

<213> unknown 
<220> 

<223> adenylate kinase (3ADK) 

<400> 13 

Gly Gly Pro Gly Ser Gly Lys Gly 
1 5 



<210> 14 

<211> 223 

<212> PRT 

<213> unknown 
<220> 

<223> amino acid sequence of trypsin 

<400> 14 



lie Val Gly Gly Tyr Thr Cys Cys Ala Asn Thr Val Pro Tyr Gin Val 
15 10 15 



Ser Leu Asn Ser Gly Tyr His Phe Cys Gly Gly Ser Leu lie Asn Ser 
20 25 30 



Gin Trp Val Val Ser Ala Ala His Cys Tyr Lys Ser Gly lie Gin Val 
35 40 45 



Arg Leu Gly Glu Asp Asn lie Asn Val Val Glu Gly Asn Glu Gin Phe 
50 55 60 



lie Ser Ala Ser Lys Ser lie Val His Pro Ser Tyr Asn Ser Asn Thr 
65 70 75 80 



Leu Asn Asn Asp lie Met Leu lie Lys Leu Lys Ser Ala Ala Ser Leu 

85 90 95 



Asn Ser Arg Val Ala Ser lie Ser Leu Pro Thr Ser Cys Ala Ser Ala 
100 105 110 
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Gly Thr Gin Cys Leu lie Ser Gly Trp Gly Asn Thr Lys Ser Ser Gly 
115 120 125 



Thr Ser Tyr Pro Asp Val Leu Lys Cys Leu Lys Ala Pro lie Leu Ser 
130 135 140 



Asp Ser Ser Cys Lys Ser Ala Tyr Pro Gly Gin lie Thr Ser Asn Met 
145 150 155 150 



Phe Cys Ala Gly Tyr Leu Glu Gly Gly Lys Asp Ser Cys Gin Gly Asp 

165 170 175 



Ser Gly Gly Pro Val Val Cys Ser Gly Lys Leu Gin Gly lie Val Ser 
180 185 190 



Trp Gly Ser Gly Cys Ala Gin Lys Asn Lys Pro Gly Val Tyr Thr Lys 
195 200 205 



Val Cys Asn Tyr Val Ser Trp lie Lys Gin Thr lie Ala Ser Asn 
210 215 220 



<210> 15 
<211> 239 
<212> PRT 
<213> unknown 
<220> 

<223> amino acid sequence of Elastase 
<400> 15 

Val Val Gly Gly Thr Glu Ala Gin Arg Asn Ser Trp Pro Ser Gin lie 
15 10 15 



Ser Leu Gin Tyr Arg Ser Gly Ser Ser Trp Ala His Thr Cys Gly Gly 
20 25 30 



Thr Leu lie Arg Gin Asn Trp Val Met Thr Ala Ala His Cys Val Asp 
35 40 45 
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Arg Glu Leu Thr Phe Arg Val Val Val Gly Glu His Asn Leu Asn Gin 
50 55 60 



Asn Asn Gly Thr Glu Gin Tyr Val Gly Val Gin Lys lie Val Val Pro 
65 70 75 80 



Tyr Trp Asn Thr Asp Asp Val Ala Ala Gly Tyr Asp lie Ala Leu Leu 

85 90 95 



Arg Leu Ala Gin Ser Val Thr Leu Asn Ser Tyr Val Gin Leu Gly Val 
100 105 110 



Leu Pro Arg Ala Gly Thr lie Leu Ala Asn Ser Pro Cys Tyr lie Thr 
115 120 125 



Thr Gly Trp Gly Leu Thr Arg Thr Asn Gly Gin Leu Ala Gin Thr Leu 
130 135 140 



Gin Gin Ala Tyr Leu Pro Thr Val Asp Tyr Ala lie Cys Ser Ser Ser 
145 150 155 160 



Ser Tyr Trp Gly Ser Thr Val Lys Asn Ser Met Val Cys Ala Gly Gly 

165 170 175 



Asp Gly Val Arg Ser Gly Cys Gin Gly Asp Ser Gly Gly Pro Leu His 
180 185 190 



Cys Leu Val Asn Gly Gin Tyr Ala Val His Gly Val Thr Ser Phe Val 
195 200 205 



Ser Arg Leu Gly Cys Asn Val Thr Arg Lys Pro Thr Val Phe Thr Arg 
210 215 220 



Val Ser Ala Tyr lie Ser Trp lie Asn Asn Val lie Ala Ser Asn 
225 230 235 
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<210> 16 

<211> 6 

<212> PRT 

< 2 1 3 > unknown 
<220> 

<223> histidine active sites (41-46) of elastase 

<400> 16 

Met Thr Ala Ala His Cys 

1 5 



<210> 17 

<211> 6 

<212> PRT 

<213> unknown 
<220> 

<223> histidine active sites (36-41) of trypsin 

<400> 17 

Val Ser Ala Ala His Cys 

1 5 



<210> 18 

<211> 5 

<212> PRT 

< 2 1 3 > unknown 
<220> 

<223> serine active sites of elastase 

<400> 18 

Gly Asp Ser Gly Gly 
1 5 



<210> 19 

<211> 5 

<212> PRT 

<213> unknown 
<220> 

<223> serine active sites (175-179) of trypsin 
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<400> 19 

Gly Asp Ser Gly Gly 



<210> 20 
<211> 34 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Figure 46 - example of output protein A 
<400> 20 

Ala Thr Tyr Arg Leu Lys Asn Gly Asp Asp Pro Leu Ser Asp His Lys 
15 10 15 



Leu Ser Asp Phe Gly Phe Ser Gly Lys lie Leu Thr Arg Tyr Asn Gin 
20 25 30 



Ala Val 



<210> 


21 




<211> 


86 




<212> 


PRT 




<213> 


Artificial 


Sequence 


<220> 






<223> 


Figure 4 6 - 


- example 


<400> 


21 





Gly Asp Asp Pro Leu Asp Gly Ala Arg Ser Asp Phe Gly Phe 
15 10 



Ser Gly Lys lie Pro Leu Thr Arg Tyr Asn Val Gin Asn Asp Glu Ser 
15 20 25 30 



Gly Phe Arg Thr Lys His Gly Leu Asp Ala Ser Asn Ala Ala Thr Tyr 

35 40 45 
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Arg Leu Lys Asn Asp Asp Pro Leu Asp Gly Ala Arg Glu Lys Leu Ser 
50 55 60 

Asp Phe Gly Ala His Phe Ser Gly Lys lie Phe Thr Arg Tyr Asn Gin 
65 70 75 

Ala Val Gin Asn Asp Glu Ser Lys 
80 85 
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